Picture for Mingkun Yang

Mingkun Yang

SMoFi: Step-wise Momentum Fusion for Split Federated Learning on Heterogeneous Data

Add code
Nov 16, 2025
Figure 1 for SMoFi: Step-wise Momentum Fusion for Split Federated Learning on Heterogeneous Data
Figure 2 for SMoFi: Step-wise Momentum Fusion for Split Federated Learning on Heterogeneous Data
Figure 3 for SMoFi: Step-wise Momentum Fusion for Split Federated Learning on Heterogeneous Data
Figure 4 for SMoFi: Step-wise Momentum Fusion for Split Federated Learning on Heterogeneous Data
Viaarxiv icon

Qwen2.5-VL Technical Report

Add code
Feb 19, 2025
Figure 1 for Qwen2.5-VL Technical Report
Figure 2 for Qwen2.5-VL Technical Report
Figure 3 for Qwen2.5-VL Technical Report
Figure 4 for Qwen2.5-VL Technical Report
Viaarxiv icon

CC-OCR: A Comprehensive and Challenging OCR Benchmark for Evaluating Large Multimodal Models in Literacy

Add code
Dec 03, 2024
Viaarxiv icon

Sequential Visual and Semantic Consistency for Semi-supervised Text Recognition

Add code
Feb 24, 2024
Viaarxiv icon

Class-Aware Mask-Guided Feature Refinement for Scene Text Recognition

Add code
Feb 21, 2024
Viaarxiv icon

Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution

Add code
May 12, 2023
Figure 1 for Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution
Figure 2 for Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution
Figure 3 for Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution
Figure 4 for Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution
Viaarxiv icon

Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition

Add code
Jul 01, 2022
Figure 1 for Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition
Figure 2 for Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition
Figure 3 for Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition
Figure 4 for Reading and Writing: Discriminative and Generative Modeling for Self-Supervised Text Recognition
Viaarxiv icon

DeepAVO: Efficient Pose Refining with Feature Distilling for Deep Visual Odometry

Add code
May 20, 2021
Figure 1 for DeepAVO: Efficient Pose Refining with Feature Distilling for Deep Visual Odometry
Figure 2 for DeepAVO: Efficient Pose Refining with Feature Distilling for Deep Visual Odometry
Figure 3 for DeepAVO: Efficient Pose Refining with Feature Distilling for Deep Visual Odometry
Figure 4 for DeepAVO: Efficient Pose Refining with Feature Distilling for Deep Visual Odometry
Viaarxiv icon

Scene Text Retrieval via Joint Text Detection and Similarity Learning

Add code
Apr 04, 2021
Figure 1 for Scene Text Retrieval via Joint Text Detection and Similarity Learning
Figure 2 for Scene Text Retrieval via Joint Text Detection and Similarity Learning
Figure 3 for Scene Text Retrieval via Joint Text Detection and Similarity Learning
Figure 4 for Scene Text Retrieval via Joint Text Detection and Similarity Learning
Viaarxiv icon

Efficient Backbone Search for Scene Text Recognition

Add code
Mar 14, 2020
Figure 1 for Efficient Backbone Search for Scene Text Recognition
Figure 2 for Efficient Backbone Search for Scene Text Recognition
Figure 3 for Efficient Backbone Search for Scene Text Recognition
Figure 4 for Efficient Backbone Search for Scene Text Recognition
Viaarxiv icon